Font Distribution Observation by Network-Based Analysis

نویسندگان

  • Chihiro Nakamoto
  • Rong Huang
  • Sota Koizumi
  • Ryosuke Ishida
  • Yaokai Feng
  • Seiichi Uchida
چکیده

The off-the-shelf Optical Character Recognition (OCR) engines return mediocre performance on the decorative characters which usually appear in natural scenes such as signboards. A reasonable way towards the so-called camera-based OCR is to collect a large-scale font set and analyze the distribution of font samples for realizing some character recognition engine which is tolerant to font shape variations. This paper is concerned with the issue of font distribution analysis by network. Minimum Spanning Tree (MST) is employed to construct font network with respect to Chamfer distance. After clustering, some centrality criterion, namely closeness centrality, eccentricity centrality or betweenness centrality, is introduced for extracting typical font samples. The network structure allows us to observe the font shape transition between any two samples, which is useful to create new fonts and recognize unseen decorative characters. Moreover, unlike the Principal Component Analysis (PCA), the font network fulfills distribution visualization through measuring the dissimilarity between samples rather than the lossy processing of dimensionality reduction. Compared with K-means algorithm, network-based clustering has the ability to preserve small size font clusters which generally consist of samples taking special appearances. Experiments demonstrate that the proposed network-based analysis is an effective way to grasp font distribution, and thus provides helpful information for decorative character recognition.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimization of Hydrogen Distribution Network by Imperialist Competitive AlgorithmIn

In modern refineries, hydrogen is widely used for the production of clean fuels. In this paper, a new method is presented in order to use <span style="font-size: 11pt; color: #000000; font-style: norma...

متن کامل

Analysis of Bacterial Contaminant in Pasir Gudang, Johor Tap Water Supply–Varies pH Value Observation

The number of breakthrough pathogenic activity in water distribution network system is constantly increasing day by day especially at level of consumption. Bacterial growth or survival rate often relates to acidity and alkalinity of water. Sudden changes in pH value and temperature indicates a possibility of present bacterial contaminant in aqueous environment. The observation of pH- and temper...

متن کامل

Font Recognition of Chinese Character Based on Multi-Scale Wavelet

Optical character recognition system research has been acquired howling success, but the reconstruction of layout needs fonts of the characters. In this paper, a novel font recognition algorithm is proposed, which is based on multi-scale wavelet analysis. We adopt wavelet analysis and the grid method to deal with the character image, and extract wavelet energy density feature, and apply the BP ...

متن کامل

Artificial neural network to predict the health risk caused by whole body vibration of mining trucks

Drivers of mining trucks are exposed to whole-body vibrations (WBV) and shocks during the various working cycles. These exposures have an adversely influence on the health, c...

متن کامل

Modeling and Spatio-Temporal Analysis of the Distribution of O3 in Tehran City Based on Neural Network and Spatial Analysis in GIS Environment

Air pollution is one of the most problems that people are facing today in metropolitan areas. Suspended particulates, carbon monoxide, sulfur dioxide, ozone and nitrogen dioxide are the five major pollutants of air that pose many problems to human health. The goal of this study is to propose a spatial approach for estimation and analyzing the spatial and temporal distribution of ozone based on ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013